Efficient Discriminative Convolution Using Fisher Weight Map
نویسنده
چکیده
Convolutional neural networks (CNNs) have been studied for a long time, and recently gained increasingly more attention. Deep CNNs have especially achieved remarkably high performance on many visual recognition tasks due to their high levels of flexibility. However, since CNNs require numerous parameters to be tuned via iterative operations through layers, their computational cost is immense. Moreover, they often require a huge number of training samples and technical tricks, such as unsupervised pretraining and heuristic tuning, to successfully train the system. In this work, we present a very simple method of layer-wise convolution. We can obtain discriminative filters by using a Fisher weight map, which well separates convolved images between categories. This operation can be deterministically solved as a simple eigenvalue problem and no back propagation or hyper-parameters are needed. Because our method is layer-wise and based on a simple eigenvalue problem, it is computationally efficient. Also, it is relatively stable with a moderate amount of training samples and is capable of learning densely from high-dimensional descriptors without dropping connections between neurons, which is a common practice in conventional implementations of CNNs. We demonstrated the promising performance of our method in extensive experiments with two datasets. Our network used together with appropriate pooling and rectification techniques achieved remarkably high performance that was distinctly comparable to those that were state-of-the-art.
منابع مشابه
Phoneme recognition based on fisher weight map to higher-order local auto-correlation
In this paper, we propose a new feature extraction method based on higher-order local auto-correlation (HLAC) and Fisher weight map (FWM). Widely used MFCC features lack temporal dynamics. To solve this problem, 35 types of local auto-correlation features are computed within two-dimensional local regions. These local features are accumulated over more global regions by weighting high scores on ...
متن کاملTowards Generic Fitting using Discriminative Active Appearance Models Embedded on a Riemannian Manifold
A solution for Discriminative Active Appearance Models is proposed. The model consists in a set of descriptors which are covariances of multiple features evaluated over the neighborhood of the landmarks whose locations are governed by a Point Distribution Model (PDM). The covariance matrices are a special set of tensors that lie on a Riemannian manifold, which make it possible to measure the di...
متن کاملLearning Robust Deep Face Representation
With the development of convolution neural network, more and more researchers focus their attention on the advantage of CNN for face recognition task. In this paper, we propose a deep convolution network for learning a robust face representation. The deep convolution net is constructed by 4 convolution layers, 4 max pooling layers and 2 fully connected layers, which totally contains about 4M pa...
متن کاملSpeech Emotion Recognition Using Hybrid Generative and Discriminative Models
In this paper, we use Sequential Forward Selection to select 8 dimensional frame-level features from the total 69 dimensional features, and we reduce the dimensions of utterance-level eigenvectors from 63 to 12 by fisher discriminant. Then, two kinds of GMM multidimensional likelihoods are proposed for hybrid generative and discriminative models. Experimental results on Berlin emotional speech ...
متن کاملAn Application of the Convolution Inequality for the Fisher Information
A characterization of the normal distribution by a statistical independence on a linear transformation of two mutually independent random variables is proved by using the convolution inequality for the Fisher information.
متن کامل